Alignment Research, Model Robustness, Adversarial Examples, Risk Assessment
Safe Pruning LoRA: Robust Distance-Guided Pruning for Safety Alignment in Adaptation of LLMs
arxiv.org·10h
TAI #158: The Great Acceleration: AI Revenue, M&A, and Talent Wars Erupt as the Industry Matures
pub.towardsai.net·21h
Defining Corrigible and Useful Goals
lesswrong.com·10h
HW Security: Multi-Agent AI Assistant Leveraging LLMs To Automate Key Stages of SoC Security Verification (U. of Florida)
semiengineering.com·7h
How to use Gemini 2.5 to fine-tune video outputs on Vertex AI
cloud.google.com·22h
Toward Trustworthy AI: A Zero-Trust Framework for Foundational Models
content.knowledgehub.wiley.com·20h
The 20+ most common AI terms explained, simply
threadreaderapp.com·22h
ReMAR-DS: Recalibrated Feature Learning for Metal Artifact Reduction and CT Domain Transformation
arxiv.org·10h
Loading...Loading more...